Alignment of human prosodic patterns for spoken dialogue systems

نویسندگان

  • Noriko Suzuki
  • Yasuhiro Katagiri
چکیده

An adaptive speech recognizer is a key function in the design of a robust spoken dialogue system. Our research focuses on the human tendency of prosodic alignment to one’s conversational partners. A spoken dialogue system might be able to exploit this human tendency to implicitly influence people to manage their speech at the prosodic level in order to accommodate its recognition capabilities. Consequently, this would decrease recognition errors. Prosodic alignment in human-computer interaction has been studied as part of the problems of personality alignment in the context of animated conversational characters. The present study examines human prosodic alignment tendency at more micro level, and explores whether people’s speech amplitude and pause length align to those of computer generated voices within a dialogue exchange. We found that people exhibit spontaneous short-term alignment of speech prosody to the slight prosodic changes in a computer’s voice within a session, even without the help of animated conversational characters.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Influence of different dialogue situations on user²s behavior in spoken corrections

This study analyzes the acoustic-prosodic features observed in spoken corrections in a task-oriented spoken dialogue. While previous studies have been found that spoken corrections are often predicted as hyperarticulate speech and related to some acousticprosodic events, their close analysis were not shown in terms of various dialogue situations in practical spoken dialogue systems and tasks. I...

متن کامل

Visualizing Spoken Discourse: Prosodic Form and Discourse Functions of Interruptions

In this paper we show that interruptions are important elements in the interactive character of discourse and in the resolution of issues of cognitive uncertainty and planning. By representing discourse graphically, we also show that interruptions are part of the local and global coherence that is brought about through the systematic phrase-to-phrase prosodic patterns of discourse. The specific...

متن کامل

EXPROS: Tools for exploratory experimentation with prosody

This demo paper presents EXPROS, a toolkit for experimentation with prosody in diphone voices. Although prosodic features play an important role in human-human spoken dialogue, they are largely unexploited in current spoken dialogue systems. The toolkit contains tools for a number of purposes: for example extraction of prosodic features such as pitch, intensity and duration for transplantation ...

متن کامل

Linguistic and Acoustic Changes of U Different Dialogue S

This paper presents the characteristic differences of acoustic and linguistic features observed in different spoken dialogue situations: human-human vs. human-machine interactions. We compare the acoustic and linguistic features of the user’s speech to a spoken dialogue system and to a human-operator in several landmark setting tasks for a car navigation system. It has been pointed out that spe...

متن کامل

/nailon/ – Software for Onlin

This paper presents /nailon/ – a software package for online real-time prosodic analysis that captures a number of prosodic features relevant for interaction control in spoken dialogue systems. The current implementation captures silence durations; voicing, intensity, and pitch; pseudo-syllable durations; and intonation patterns. The paper provides detailed information on how this is achieved. ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004